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Abstract Infinite games with imperfect information are known to be undecidable 
unless the information flow is severely restricted. One fundamental decidable case 
occurs when there is a total ordering among players, such that each player has 
access to all the information that the following ones receive. In this paper we 
consider variations of this hierarchy principle for synchronous games with perfect 
recall, and identify new decidable classes for which the distributed synthesis prob¬ 
lem is solvable with finite-state strategies. In particular, we show that decidability 
is maintained when the information hierarchy may change along the play, or when 
transient phases without hierarchical information are allowed. Finally, we interpret 
our result in terms of distributed system architectures. 

Keywords Infinite games ■ Imperfect information • Coordination • Distributed 
systems • Automated synthesis 

Mathematics Subject Classification (2000) 91A06 ■ 68M14 • 93B50 


1 Introduction 


To realise systems that are correct by design is a persistent ambition in computing 
science. The stake is particularly high for systems that interact with an unpre¬ 
dictable environ ment over indeterminate tim e. Pione ering result s in the area of 
synthesis, due to iBiichi and Landweberl 1 1969lh and iRabln ( 1972f) . show that the 
task can be automatised for the case of monolithic designs with correctness con¬ 
ditions specified by automata over infinite objects — words or trees representing 
computations. A most natural framework for representing and solving the prob¬ 
lem is in ter ms of infinite games with perfe ct information o ver finite graphs, as 
described bv lPnueli and Rosneil ( 199(1 ) or by EE oma sl (|l995h . 

For distributed systems in which multiple components interact with the ob¬ 
jective of satisfying a global specification, the game-theoretical formulation of the 


D. Berwanger, A. B. Mathew, and M. van den Bogaard 
LSV, CNRS & Universite Paris-Saclay, France 
E-mail: dwb@lsv.fr 
























2 


D. Berwanger, A.B. Mathew, M. van den Bogaard 


synthesis problem leads to games with imperfect information and to the ques¬ 
tion of whether there exists a winning strategy that can be distributed among 
the multiple players. Unfortunately, such games are m u ch les s amenable to auto¬ 
mated solutions: as pointed out bv IPeterson and Reid ( 19791) . it is generally un- 
decidable whether a solution —that is, a distributed winning strategy— exists 
for a finitely presented ga me for two players against Nature (or the environment); 
furthermore. 1 Jaiiinl ( 2007lf showed that, even if a solution exists, it may not be im- 
plementable by a finite-state device. As there is no hope for solving the distributed 
synthesis problem uniformly, it remains to look out for classes that allow for an 
al gorithmic treatment . For surveys on results in this d irectio n, see, e.g ., the a rticle 
of lGastin et al.l ( 20091) or the theses of lSchewel ( 20081) and of IPiicliaTa ( 20131) . 

One fundamental case in which the distributed synthesis problem becomes de¬ 
cidable is that of hierarchical systems: these correspond to games where there 
exists a total order among the players such that, informally speaking, each player 
has access to the infor mation received by the players that come later in the 
order. For such games, IPeterson and Reil ( 19791) showed that it is decidable — 
although, with nonelementary complexity—whether distributed winning strate¬ 
gies exist and if so, finite-state winni n g strategi e s can be effectively synthesised. 
The result was extended by IPnueli and Rosner] ( 199 0) to the framework of dis¬ 
tributed system architectures with linear-time specifications over path-shaped 
comm unication architectures where information can flow only in one direction. 
Later, iKupferman and Vardil ( 2001 ) developed an automata-theoretic approach 
that allows to extend the decidability result from linear-time to branching-time 
specifications, and also relaxes some of the syntactic rest rictions i mp osed by the 
fixed- architecture setting of Pnueli and Rosner. Finally, iFinkbeiner and Schewel 
( 2005 ) gave an effective characterisation of communication architectures on which 
distributed synthesis is decidable. The criterion requires absence of information 
forks, which implies a hierarchical order in which processes, or players, have ac¬ 
cess to the observations emitted by the environment. 

The setting of games is more liberal than that of architectures with fixed com¬ 
munication channels. A rather general, though non-effective condition for games 
to admit finite-state distributed winning strategies is given in [Berwanger et all 
( 20 111 ) , based on epistemic models representing the knowledge acquired by players 
in a game with perfect recall. This condition suggests that, beyond the fork-free 
architecture classification there may be further natural classes of games for which 
the distributed synthesis problem is decidable. 

In this paper, we study a relaxation of the hierarchical information pattern 
underlying the basic decidability results on games with imperfect information and 
distributed system architectures. Firstly, we extend the assumption of hierarchical 
observation, that is, positional information, by incorporating perfect recall. Rather 
than requiring that a player observes the signal received by a less-informed player, 
we require that he can infer it from his observation of the play history. It can 
easily be seen that this gives rise to a decidable class, and it is likely that previous 
authors had a perfect-recall interpretation in mind when describing hierarchical 
systems, even if the formal definitions in the relevant literature generally refer to 
observations. 

Secondly, we investigate the case when the hierarchical information order is not 
fixed, but may change dynamically along the play. This allows to model situations 
where the schedule of the interaction allows a less-informed player to become more 
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informed than others, or where the players may coordinate on designating one to 
receive certain signals, and thus become more informed than others. We show that 
this condition of dynamic hierarchical observation also leads to a decidable class 
of the distributed synthesis problem. 

As a third extension, we consider the case where the condition of hierarchical 
information (based on perfect recall) is intermittent. That is, along every play, 
it occurs infinitely often that the information sets of players are totally ordered; 
nevertheless, there may be histories at which incomparable information sets arise, 
as it is otherwise typical of information forks. We show that, at least for the case 
of winning conditions over attributes observable by all players, this condition of 
recurring hierarchical observation is already sufficient to ensure decidability of the 
synthesis problem, and that finite-state winning strategies exist for all solvable 
instances. 

For all the three conditions of hierarchical information, it is decidable with rela¬ 
tively low complexity whether they hold for a given game. However, the complexity 
of solving a game is nonelementary in all cases, as they are mo re gener a l than the 
condit ion of hierarchical observation, for which it was shown by [Peterson and Reill 
(1979) that there exists no elementary solution procedure. 

The last part of the paper presents an interpretation of the game-theoretic 
results in terms of distributed reactive s ytems . Towards this, we extend the frame¬ 
work introduced bv IPnueli and Rosneri ( 19901 ) which features hard-wired commun- 
ciation graphs, to a model where the global actions are transduced into signals for 
the individual processes by a deterministic finite-state monitor. In this framework 
of monitored architectures, we identify classes that correspond to games with hi¬ 
erarchical information and therefore admit an effective solution of the distributed 
synthesis problem. 


2 Preliminaries 


2.1 Games on graphs 


We use the standard mod el of concurrent games with imperfect information, fol¬ 
lowing the notation from iBerwaneer et al. (2011’). There is a set N = {1,... ,n} 
of players and a distinguished agent called Nature. We refer to a list of elements 
x = (x l )i£]\r, one for each player, as a profile. For each player i, we fix a set A 1 of 
actions and a set B l of observations', these are finite sets. 

A game graph G = (V,E, consists of a set V of nodes called positions, 

an edge relation E C V x AxV representing simultaneous moves labelled by action 
profiles, and a profile of observation functions : V —>• B l that label every position 
with an observation for each player. We assume that the graph has no dead ends, 
that is, for every position v £ V and every action profile a £ A, there exists an 
outgoing move (v, a, w) £ E, and we denote the set of successors of a position v by 
vEA := {ui \(v,a, w) for some a £ A}. 

Plays start at a designated initial position vo £ V and proceed in rounds. In a 
round at position v, each player i chooses simultaneously and independently an 
action a 1 £ A 1 , then Nature chooses a successor position v' reachable along a move 
{v,a,v') £ E. Now, each player i receives the observation /3‘(v'), and the play 
continues from position v'. Thus, a play is an infinite sequence iv = vq, v\, V 2 , ■ ■ ■ 
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of positions, such that for all i > 0, there exists a move (vt,a,V£+ 1 ) £ E. A history 
is a nonempty prefix n = vo, v\,..., vp of a play; we refer to £ as the length of the 
history, and we denote by Hist(G) the set of all histories in the game graph G. 
The observation function extends from positions to histories^ and plays as /3 l (n) = 
P'(vi)/3 1 (V 2 ) ■ ■ ■, and we write Hist*(G) := \ n £ Hist(G)} for the set of 

observation histories of player i. We say that two histories n,n' are indistinguishable 
to player i. and write 7 r ~ l n', if they yield the same observation /3 l (n) = p l (jr'). 
This is an equivalence relation, and its classes are called information sets. The 
information set of player i at history n is P*( 7 r) := JV_G H ist(G) | n' 7 r|. In 
terms of the taxonomy for distributed systems bv lHalpern and Vardil (1989), our 
model is synchronous and of perfect recall. 

A strategy for player i is a mapping s l : V* —> A 1 from histories to actions that 
is information-consistent in the sense that s l (n) = s l (n'), for any pair n ~ l 7 r of 
indistinguishable histories. We denote the set of all strategies of player i by S l 
and the set of all strategy profiles by S. A history or play n = vo, vi,... follows 
the strategy s l £ S l if, for every £ > 0, we have (v£,a,vg + i) £ E for some action 
profile a where a 1 = s l (vo,vi,... ,vg). The play 7 r follows a strategy profile s £ S 
if it follows all component strategies s’ 1 . The set of possible outcomes of a strategy 
profile s is the set of plays that follow s. 

A winning condition over a game graph G is a set W C V u of plays. A distributed 
game G = (G,W) is described by a game graph and a winning condition. We say 
that a play 7 r is winning in Q if n £ W. A strategy profile s is winning in Q if 
all its possible outcomes are so. In this case, we refer to s as a distributed winning 
strategy. We generally assume that the game graph is finite or finitely presented. 
The general distributed synthesis problem asks whether for a given game graph and 
a fixed or given winning condition, there exists a distributed winning strategy. 


2.2 Automata 


Our focus is on finitely-represented games, where the game graphs are finite and 
the winning conditions described by finite-state automata. Specifically, winning 
conditions are given by a colouring function 7 : V —> C and an w-regular set W C 
G“ describing the set of plays vo, vi,... with 7 (^ 0 ), 7 ( 1 ) 1 ), • • • £ W. In certain cases, 
we assume that the colouring is observable to each player i, that is, /3* (v) P l (v') 
whenever 7 (n) 'y( v'). For gener al background on automata for games, we refer 

to the handbook bv lGradel et al.l ( 2002 i) . 

Strategies shall also be represented as finite-state machines. A Moore machine 
over an input alphabet E and an output alphabet T is described by a tuple 
(A/, mo, p, v) consisting of a finite set M of memory states with an initial state mo, a 
memory update function p : M x E —¥ M and an output function i/:Jlf-tr defined 
on memory states. Intuitively, the machine starts in the initial memory state mo, 
and proceeds as follows: in state m, upon reading an input symbol x £ E, it up¬ 
dates its memory state to m' := p(m, x) and then outputs the letter v(m). Formally, 
the update function p is extended to input words in E* by setting, p(e) := mo, 
for the empty word, and by setting, p{xo ■ ■ ■ X£-\Xg) := p{ p(xo ■ ■ ■ 2 ^- 1 ), 27 ), for 


1 Note that we discard the observation at the initial position; this is technically convenient 
and does not restrict the model. 
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all nontrivial words xo ■ ■ .xg_\Xg. This gives rise to the function M : E* —» F* 
implemented by M , defined by M(x o,..., xg) := v(ftx o ... xg )). A strategy automa¬ 
ton for player i on a game graph G, is a Moore machine M with input alpha¬ 
bet B l and output alphabet A 1 . The strategy implemented by M is defined as 
s l (y o,..., vg) := M(P z {yo ■ ■ ■ vg)), for all t > 0. A finite-state strategy is one that can 
be implemented by a strategy automaton. 

Sometimes it is convenient to refer to Mealy machines rather than Moore ma¬ 
chines. These are finite-state machines of similar format, with the only difference 
that the output function v : M x E —¥ T is defined on transitions rather than their 
target state. 

In the following we will refer to several classes C of finite games, always as¬ 
suming that winning conditions are given as ^-regular languages. The finite-state 
synthesis problem for a class C is the following: Given a game Q € C, 

(i) decide whether Q admits a finite-state distributed winning strategy, and 

(ii) if yes, construct a profile of finite-state machines that implements a dis¬ 
tributed winning strategy for Q. 

We refer to the set of distributed (finite-state) winning strategies for a given game 
Q as the (finite-state) solutions of Q. We say that the synthesis problem is finite- 
state solvable for a class C if every game Q £ C that admits a solution also admits 
a finite-state solution, and if the above two synthesis tasks can be accomplished 
for all instances in C. 


3 Static Information Hierarchies 


3.1 Hierarchical observation 


We set out from the basic pattern of hierarchical information underlying the de¬ 
cidability results cited in the introduction. These results rely on a positional in¬ 
terpretation of information, i.e., on observations. 

Definition 1 A game graph yields hierarchical observation if there exists a total 
order A among the players such that whenever i A j, then for all pairs v,v' of 
positions, ft(v) = /3 l {v') implies /3 J (v) = ft (v r ) 

In other words, if i A j, then the observation of player i determines the obser¬ 


vation^ pl ayer j . An examp le of such a situation is illustrated in Figure 1(a) 
IPeterson and Rein (197SJ) study a game with players organised in a hierarchy, 
such that each player i sees the data observed by player i — 1. The setting is 
actually generic for games with reachability winning conditions and the authors 
show that winning strategies can be synthesised in n-fo l d exp onential time and 
this complexity is unavoidable. Later. IPnueli and Rosnen ( 199011 consider a similar 
model in the context of distributed systems with linear-time specifications given 
by finite automata on infinite words. Here, the hierarchical organisation is repre¬ 
sented by a pipeline architecture which allows each process to send signals only 
to the following one. The authors show that the distributed synthesis problem for 
such a system, is solvabl e via a n automata-theoret ic technique. This technique 
is further extended by iKupferman and Vardil ( 200ll) to more general, branching¬ 
time specifications. The key operation of the construction is that of widening - 
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(b) static hierarchical information 


Fig. 1 Basic patterns of hierarchical information: game positions show the observation of 
player 1 (left) and player 2 (right); the name of the position (middle) is unobservable 


a finite-state interpretation of strategies for a less-informed player j within the 
strategies of a more-informed player i -< j. This allows to first solve a game as if 
all the moves were performed by the most-informed player, which comes first in 
the order X, and successively discard solutions that cannot be implemented by the 
less-informed players, i.e., those which involve strategies that are not in the image 
of the widening interpretation. 

The automata-th eoretic method for s olving the synthesis problem on pipelin e 
architectures, due to lPnueli and Rosner s ( 199(1 ) an d Kupferman and Vardi ([2001), 
can be adapted directly to solve the synthesis problem for games with hierarchi¬ 
cal observation. Alternatively, the solvability result follows from the reduction of 
games with hierarchical observations to pipeline architectures presented in Theo¬ 
rem [20] as a part of our discussion on games and architectures. 


Theorem 2 I Pnueli and Rosnerl ( 1990h : iKupferman and Vardi] ( 200 il l) For 

games with hierarchical observation, the synthesis problem is finite-state solvable. 


3.2 Incorporating perfect recall 

In a first step, we extend the notion of hierarchical information to incorporate 
the power of perfect recall that players have. While maintaining the requirement 
of a fixed order, we now ask that the information set of a player determines the 
information sets of those who follow in the order, as in the example of Figure [T(b)[ 

Definition 3 A game graph yields (static) hierarchical information if there exists 
a total order X among the players such that, for all histories tt, if i X j , then 
P^n) C P j (n). 

The following lemma provides an operational characterisation of the condition. 
We detail the proof, as its elements will be used later. 
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Lemma 4 A game graph G yields static hierarchical information if, and only if, for 
every pair i A j of players, there exists a Moore machine that outputs ft (7r) on input 
ft (ft, f or every history n in G. 

Proof For an arbitrary game graph G, let us denote the relation between the 
observations of two players i and j along the histories in G by 

T ij ■= {(ft (ft, ft (ft) £ (B l x B 3 )* \ n £ Hist(G)}. 

Let us first assume that there exists a Moore machine that recognises T lJ . Then 
T lJ is actually a function and hence tv ft implies 

ft (ft = T ij (ft(ft) = T ij (ft(ft)) = ft (ft), 
and we can conclude that 7r ft. 

To see that the converse implication holds, notice that T' 13 is a regular relation, 
recognised by the game graph G viewed as a finite-word automaton A*q over the 
alphabet of observation pairs B l x B 3 . Concretely, consider the nondeterministic 
automaton A ^ := (V, B l x B 3 , vo, A, V) on states corresponding to positions of 
G, with initial state vo and with transitions (v, (b l ,b 3 ),v') £ A if there exists a 
move ( v,a,v') £ E such that ft (ft) = b l and ft (ft) = b 3 ; all states are accepting. 
Further, let M l] be the automaton obtained by determinising A l £ and trimming 
the result, that is, removing all states that do not lead to an accepting state. 

Now assume that the game graph G at the outset yields static hierarchical 
information. Then, the relation T 1 ' 3 recognised by M 13 is functional, and hence 
M %3 is deterministic in the input component i: for any state v there exists precisely 
one outgoing transition along each observation 6* £ B l . In other words, M 13 is a 
Mealy machine, which we can transform into an equivalent Moore machine, as 
desired. □ 

Theorem 5 For games with static hierarchical information, the synthesis problem is 
finite-state solvable. 

Proof Intuitively, we transform an arbitrary game graph G = (V, E, ft with static 
hierarchical information into one with hierarchical observation, by taking the syn¬ 
chronised product of G with automata that signal to each player i the observations 
of all players j F i. We shall see that this preserves the solutions to the distributed 
synthesis problem, for any winning condition on G. 

To make the construction precise, let us fix a pair i < j of players, and consider 
the Moore machine M lJ = (M, mo,(i, ft as in the proof of Lemma[T] which trans¬ 
lates the observations ft (ft into ft (ft, for every history n in G. We define the prod¬ 
uct G x as a new game graph with the same sets of actions as G, and the same 
observation alphabets except for player i, for which we expand the alpha¬ 

bet to B l x B 3 to also include observations of player j. The new game is over posi¬ 
tions in V x M with moves ((v, m), a, (v r , m')) if (v, a, v r ) £ E and p.(m, ft(v)) = m!. 
The observations for player i are given by ft(v,m) = (ft(v),v(m)), whereas they 
remain unchanged for all other players ft(v, m) = ft(v), for all k ft- i. 

The obtained product graph is equivalent to the original game graph G, in the 
sense that they have the same tree unravelling, and the additional components in 
the observations of player i (representing observations of player j, given by the 
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Moore machine M lJ ) are already determined by his own observation history, so 
player i cannot distinguish any pair of histories in the new game that he could 
not distinguish in the original game. Accordingly, the strategies on the expanded 
game graph G x M lJ correspond to strategies on G , such that the outcomes of any 
distributed strategy are preserved. In particular, for any winning condition over G, 
a distributed strategy is winning in the original game if, and only if, it is winning 
in the expanded game G x M lJ . On the other hand, the (positional) observations 
of Player i in the expanded game determine the observations of Player j. 

By applying the transformation for each pair i < j of players successively, we 
obtain a game graph that yields hierarchical observation. Moreover, every winning 
condition on G induces a winning condition on (the first component of positions 
in) G x M lJ such that the two resulting games have the same winning strategies. 
Due to Theorem [2j we can thus conclude that, under w-regular winning conditions, 
the synthesis problem is finite-state solvable for games with static hierarchical 
information. □ 


To decide whether a given game graph yields static hierarchical information, 
the collection of Moore machines constructed according to Lemma [TJ for all play¬ 
ers i,j, may be used as a witness. However, this yields an inefficient procedure, 
as the determinisation of a functional transduce r involves an exponential blowup; 
precise bounds for such translations are given bv IWeber and Klernml ( 19951) . More 
directly, one could verify that each of the transductions A relating observation 
histories of Players i,j , as defined in the proof of Lemma [U is funct ional. This 
can be done in polynomial time using, e.g., the procedure described in lBeal et al.l 

(l2000h . 

We can give a precise bound in terms of nondeterministic complexity. 


Lemma 6 The problem of deciding whether a game yields static hierarchical informa¬ 
tion is NLogSpace -complete. 


Proof The complement problem—of verifying that for a given game there exists a 
pair of players i, j that cannot be ordered in either way — is solved by the following 
nondeterministic procedure: Guess a pair i,j of players, then check that i j , 
by following nondeterministically a pair of histories tv tv' , such that tv rp P- 
symmetrically, check that j i. The procedure requires only logarithmic space 
for maintaining pointers to four positions while keeping track of the histories. 
Accordingly, the complement problem is in NLogSp a ce, and since the complexi ty 
class is closed under complementation ( Immermanl ( 19881) : ISzelepcsenvil (Il988l) h 
our decision problem of whether a game yields static hierarchical information also 
belongs to NLogSpace. 

For hardness, we reduce the emptin ess pr o blem for nondeterministic finite au¬ 
tomata, known to be NLoGSPACE-hard ( .Tonesl (1975)), to the problem of verifying 
that the following game for two players playing against Nature on the graph of 
the automaton yields hierarchical information: Nature chooses a run in the au¬ 
tomaton, the players can only observe the input letters, unless an accepting state 
is reached; if this happens, Nature sends to each player privately one bit, which 
violates the condition of hierarchical information. Thus, the game has hierarchical 
information if, and only if, no input word is accepted. □ 
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3.3 Signals and game transformations 

Functions that return information about the current history, such as those con¬ 
structed in the proof of LemmaS] will be a useful tool in our exposition, especially 
when the information can be made observable to certain players without changing 
the game. 

Given a game graph G, a signal is a function defined on the set of histories 
in G, or on the set of observation histories of some player i. We say that a sig¬ 
nal / : Hist(G) — > S is information-consistent for player i if any two histories 
that are indistinguishable to player i have the same image under /; in particular, 
strategies are information-consistent signals. A finite-state signal is one imple¬ 
mented by a Moore machine. Any finite-state signal / : Hist(G) —> E can also 
be implemented by a Moore machine M l over the observation alphabet B l , such 
that that M(- tv) = for every history n. The synchronisation of G with 

a finite-state signal / is the expanded game graph (G, /) obtained by taking the 
synchronised product G x M , as described in the proof of Lemma U In case / is 
information-consistent for player i, it can be made positionally observable to this 
player, without changing the game essentially. Towards this, we consider the game 
graph (G, /*) that expands (G, /) with an additional observation component f l {v) 
for player i at every position v, such that /(• k) = f l {v) for each history 7r that 
ends at v. The game graph (G,/*) is finite-state equivalent to G, in the sense that 
every strategy for G maps via finite-state transformations to a strategy for (G, f l ) 
with the same outcome and vice versa. Indeed, any strategy for G is readily a 
strategy with the same outcome for (G, f l ) and, conversely, every strategy pro¬ 
file s in (G,/*) can be synchronised with the Moore machines implementing the 
signals f l for each player i, to yield a finite-state strategy profile s' for G with the 
same outcome as s. In particular, the transformation preserves solutions to the 
finite-state synthesis problem under any winning condition. 


4 Dynamic Hierarchies 


In this section, we maintain the requirement on the information sets of players 
to be totally ordered at every history. However, in contrast to the case of static 
hierarchical information, we allow the order to depend on the history and to change 
dynamically along a play. Figure 2(a) shows an example of such a situation: at the 
history reaching V 2 , player 1 is more informed than player 2, however, the order 
switches when the play proceeds to position V 4 , for instance. 


Definition 7 A history n in a game yields hierarchical information if the informa¬ 
tion sets {P l ( 7r) | i e JV} are totally ordered by inclusion. A game graph yields 
dynamic hierarchical information if every history yields hierarchical information. 


We first observe that, for every finite game, the set of histories that yield hier¬ 
archical information is regular. We detail here the construction of an automaton 
for the complement language, which will also be of later use. 


Lemma 8 For every finite game graph G, we can construct a nondeterministic finite 
automaton that accepts the histories in G that do not yield hierarchical information. 
IfG hasn players and \ V\ positions, the number of automaton states is at most‘2n 2 \V\ 2 . 




10 


D. Berwanger, A.B. Mathew, M. van den Bogaard 





(a) dynamic hierarchical information 



(b) recurring hierarchical information 


Fig. 2 More patterns of hierarchical information 


Proof Let us fix a game graph G. A history n in G fails to yield hierarchical 
information if there are two players with incomparable information sets at 7r. To 
verify this, we construct an automaton that chooses nondeterministically a pair i, j 
of players, then, while reading the input 7t, it guesses a pair 7 /, tv" of histories such 
that tv' 7 r and tv" rJ> n and updates two flags indicating whether 7 / 76 * it" or 
tv' ^ tv" \ the input is accepted if both flags are set. Hence, a word 7r £ V* that 
corresponds to a history in G is accepted if, and only if, the corresponding history 
does not yield hierarchical information. 0 

In its states, the constructed automaton stores the indices of the two players i, j, 
a pair of game positions to keep track of the witnessing histories tv' and tv" , and a 
two-bit flag to record whether the current input prefix is distinguishable from tv' 
for player j or from n" for player i. Clearly, it is sufficient to consider each pair of 
players only once, hence, the automaton needs at most 4 "( n 2 ~ 1 ) |y| 2 states, that 
is, less than 2n 2 \V\ 2 . □ 

To decide whether a given game graph G yields dynamic hierarchical infor¬ 
mation, we may check whether the automaton described in Lemma [H] accepts all 
histories in G. However, more efficient than constructing this automaton, we can 
use a nondeterministic procedure similar to the one of Lemma[6]to verify on-the-fly 
if there exists a history at which the information sets of two players are incom¬ 
parable: guess two players i,j and three histories 7r n' and n" tv , such that 
tv' tv" and tv' rp tv" . Obviously, the lower bound from Lemma[8]is preserved. 

Lemma 9 The problem of deciding whether a game graph yields dynamic hierarchical 
information is NLoGSPACE-compZefe. 

In the remainder of the section, we show that, under the more liberal condition 
of dynamic hierarchical information, distributed games are still decidable. 

2 Notice that the automaton may also accept words that do not correspond to game histo¬ 
ries; to avoid this, we can take the synchronised product with the game graph G and obtain 
an automaton that recognises precisely the set of histories that do not yield hierarchical infor¬ 
mation. 
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Theorem 10 For games with dynamic hierarchical information, the synthesis problem 
is finite-state solvable. 

For the proof, we transform an arbitrary game Q with dynamic hierarchical 
information into one with static hierarchical information, among a different set 
of n shadow players 1 n', where each shadow player i! plays the role of the 
i-most informed player in the original game, in a sense that we will make precise 
soon. The information sets of the shadow players follow their nominal order, that 
is, if i < j then P l (n) C P J (n). The resulting shadow game inherits the graph 
structure of the orginal game, and we will ensure that, for every history n, 

(i) each shadow player il has the same information (set) as the i-most informed 
actual player, and 

(ii) each shadow player i' has the same choice of actions as the i-most informed 
actual player. 

This shall guarantee that the shadow game preserves the winning status of the 
original game. 

The construction proceeds in two phases. Firstly, we expand the game graph G 
so that the correspondence between actual and shadow players does not depend 
on the history, but only on the current position. This is done by synchronising G 
with a finite-state machine that signals to each player his rank in the information 
hierarchy at the current history. Secondly, we modify the game graph, where the 
shadow-player correspondence is recorded as a positional attribute, such that the 
observation of each player is received by his shadow player, at every position; 
similarly, the actions of each player are transferred to his shadow player. Finally, we 
show how finite-state winning strategies for the shadow game can be re-distributed 
to the actual players to yield a winning profile of finite-state winning strategies 
for the original game. 


4.1 Information rank signals 

For the following, let us fix a game Q with dynamic hierarchical information with 
the usual notation. For a history n, we write for the total order among players 
induced by the inclusions between their information sets at n. To formalise the 
notion of an i-most informed player, we use the shortcut i j to denote that 
i Xtt j and j i\ likewise, we write i F-k j to denote that i j and not j i- 
Then, the information rank of player i on the game graph G is a signal rank* : 
Hist(G) — y N defined by 

rank*(7r) := |{j € N \ j -< n i or ( j < i and j i) }|. 

Likewise, we define the order of player i relative to player j as a Boolean signal 
: Hist(G) —» (0,1} with X*. (n) = 1 if, and only if, i j. 

Lemma 11 The information rank of each player i and his order relative to any player j 
are finite-state signals that are information-consistent to player i. 


Proof We detail the argument for the rank, the case of relative order is similar and 
simpler. 
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Given a game Q as in the statement, let us verify that the signal rank* is 
information-consistent, for each player i. Towards this, consider two histories 7r ~* 
tv' in G, and suppose that some player j does not count for the rank of i at tt, in 
the sense that either i -< n j or (i «,r j and i < j) — in both cases, it follows that 
7r 7r / , hence P J ( 7r ) = p\ tt'), which implies that j does not count for the rank 
of i at n' either. Hence, the set of players that count for the rank of player i is the 
same at tv and at tv’ , which means that rank*(7r) = rank*(7r). 

To see that the signal rank* can be implemented by a finite-state machine, we 
first build, for every pair i,j of players, a nondeterministic automaton A? that 
accepts the histories tv where j -<„ r i, by guessing a history tv' ~* tv and verifying 
that tv' 7 ^ 3 tv. To accept the histories that satisfy i j, we take the product of the 
automata Aj and Aj for i j and j -<■* i and accept if both accept. Combining 
the two constructions allows us to describe, for every player j, an automaton Aj 
to recognise the set of histories at which j counts for rank* (tv) . 

Next, we determinise each of the automata Aj and take appropriate Boolean 
combinations to obtain a Moore machine AT with input alphabet V and output 
alphabet V(N ), which upon reading a history iv in G, outputs the set of players 
that count for rank*(-7r). Finally we replace each set in the output of AT' by its size 
to obtain a Moore machine that returns on input tv e V*, the rank of player i at 
the actual history tv in G. 

As we showed that rank* is an information-consistent signal, we can conclude 
that there exists a Moore machine that inputs observation histories /?* (it) of player i 
and outputs rank*(-7r). □ 

One consequence of this construction is that we can view the signals rank* 
and < l j as attributes of positions rather than properties of histories. Accordingly, 
we can assume without loss of generality that the observations of each player i have 
an extra rank component taking values in N and that the symbol j is observed at 
history tv in this component if, and only if, rank*(7r) = j. When referring to the 
positional attribute A*- at v, it is more convenient to write i A v j rather than A*. 


4.2 No crossing 


As we suggested in the proof outline, each player i and his shadow player, identified 
by the observable signal rank*, should be equally informed. To achieve this, we will 
let the observation of player i be received by his shadow, in every round of a play. 
However, since the rank of players, and hence the identity of the shadow, changes 
with the history, an information loss can occur when the information order between 
two players, say 1^2 along a move is swapped to become 2 -< 1 in the next round. 
Intuitively, the observation received by player 2 after this move contains one piece 
of information that allows him to catch up with player 1, and another piece of 
information to overtake player 1. Due to their rank change along the move, the 
players would now also change shadows. Consequently, the shadow of 1 at the 
target position, who was previously as (little) informed as player 2, just receives 
the new observation of player 1, but he may miss the piece of information that 


allowed player 2 to catch up (and which player 1 had). Figure 3(a) pictures such 
a situation. 
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(a) crossing: 1 -< 2 at V2 to 2 -< 1 at 114 (b) half-step lookahead 


Fig. 3 Eliminating crossings 


We describe a transformation to eliminate the crossings due to switches in the 
information order, such that this artefact does no longer occur. Formally, for a play 
7r in a game, we say that Player i and j cross at stage £ if P 1 {tvi) C P° {Tvf) and 
P J C P z (tv( + i). We say that a game with dynamic hierarchical information 

is cross-free if there are no crossing players in any play. 

Lemma 12 Every game with dynamic hierarchical information is finite-state equiva¬ 
lent to a game that is cross-free. 

Proof Let G be a game graph with dynamic hierarchical information. We define 
a signal for each pair of players i,j that represents the knowledge that player j 
has about the current observation of player i. If this signal is made observable 
to Player i only at histories tv at which i j, the game remains essentially 
unchanged, as players only receive information from less-informed players, which 
they could hence deduce from their observation. Concretely, we define the signal 
A} : V* -» by 

Xj(rv) := {f3 z (v') : v is the last state of some history tv' £ P\iv)}. 

Clearly, this is a finite-state signal. 

Now we look at the synchronised product of G with the signals (A® and 

the relative-order signal < l j constructed in the proof of Lemma[TT] In the resulting 
game graph, the signal value A* (tt) at a history tv that ends at a position w is 
represented by the position attribute A” (w). We add to every move ( v,a,w ) an 
intermediary position u, at which we assign, for every player i the observation 
(A*-(w) : i < w j}. Intuitively, this can be viewed as a half-step lookahed signal 
that player i receives from player j who may have been more informed at the source 
position v - thus the signal is not necessarily information-consistent for player i. 
Nevertheless, the game remains essentially unchanged after adding the signal, as 
the players cannot react to the received observation before reaching the target w, 
at which point the information is readily revealed. On the other hand, along moves 
at which the information order between players switches, the intermediary position 
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ensures that the players attain equal information. The construction is illustrated 


in Figure 3(b) 


For any game Q on G, we adjust the winning condition to obtain one for the new 
game graph, by ignoring the added intermediary positions. As the added positions 
have only one successor, the players have no relevant choice, so any distributed 
winning strategy for the new game corresponds to one for Q and vice versa. In 
particular, for cu-regular winning conditions, the construction yields a game with 
no crossings that is finite-state equivalent to the original game. □ 


4.3 Shadow players 

We are now ready to describe the construction of the shadow game associated 
to a game Q = ( V ., E,/3,W) with dynamic hierarchical information. Without loss of 
generality, we can assume that every position in G is marked with the attributes 
rank*(u) and A*-, for all players i, j according to Lemma [Tl] and that the game 
graph is cross-free, according to Lemma [T2l 

The shadow game Q' = (V U {0}, E', /3', W ) is also played by n players and has 
the same winning condition as Q . The action and the observation alphabet of each 
shadow player consists of the union of the action and observation alphabets of all 
actual players. The game graph G' has the same positions as G, plus one sink © 
that absorbs all moves along unused action profiles. The moves of G' are obtained 
from G by assigning the actions of each player i to his shadow player j = rank* (v) 
as follows: for every move (v, a, v') G E, there is a move (v,x,v r ) G E 1 labelled 
with the action profile x obtained by a permutation of a corresponding to the rank 
order, that is, a 1 = x° for j = rank z (u), for all players i. Finally, at every position 
v G V, the observation of any player i in the original game Q is assigned to his 
shadow player, that is (3'° (v) := /3 l (v), for j = rank*(u). 

By construction, the shadow game yields static hierarchical information, ac¬ 
cording to the nominal order of the players. We can verify, by induction on the 
length of histories, that for every history n, the information set of player i at 7r 
in G is the same as the one of his shadow player rank l (7r) in G'. 

Finally, we show that the distributed synthesis problem for G reduces to the 
one on G' , and vice versa. To see that Q' admits a winning strategy if Q does, let 
us fix a distributed strategy s for the actual players in Q. We define a signal a° : 
Hist(G / ) —» A for each player in Q', by setting a J (n) := s l (n) if j = rank*(7r), 
for each history n. This signal is information-consistent for player j , since, at 
any history 7r, his information set is the same as for the actual player i with 
rank*(7r) = j , and because the strategy of the actual player i is information- 
consistent for himself. Hence, a J is a strategy for player j in G'. Furthermore, 
at every history, the action taken by the shadow player j = rank*(7r) has the 
same outcome as if it was taken by the actual player i in G. Hence, the set of 
play outcomes of the profiles s and a are the same and we can conclude that, if 
there exists a distributed winning strategy for G, then there also exists one for G'. 
Notice that this implication holds under any winning condition, without assuming 
w-regularity. 

For the converse implication, let us suppose that the shadow game Q' admits a 
winning profile a of finite-state strategies. We consider, for each actual player i of G, 
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the signal s* : Hist(G) — > A 1 that maps every history 7r to the action s*(7r) := a J (jr) 
of the shadow player j = rank 1 (7r). This is a finite-state signal, as we can implement 
it by synchronising G with rank*, the observations of the shadow players, and 
the winning strategies er^, for all shadow players j. Moreover, s* is information- 
consistent to the actual player i, because all histories n £ P l (it), have the same 
value rank l (-7r) =: j, and, since a J is information-consistent for player j, the actions 
prescribed by a J (71-) must be the same, for all 7 r £ P J (71-) = P l (7c) . In conclusion, 
the signal s l represents a finite-state strategy for player i. The profile s has the 
same set of play outcomes as <7, so s is indeed a distributed finite-state strategy, 
as desired. 

In summary, we have shown that any game Q with dynamic hierarchical in¬ 
formation admits a winning strategy if, and only if, the associated shadow game 
with static hierarchical observation admits a finite-state winning strategy. The 
latter question is decidable according to Theorem [5] We showed that for every 
positive instance G' , we can construct a finite-state distributed strategy for G. 
This concludes the proof of Theorem 1101 


5 Transient Perturbations 

As a third pattern of hierarchical information, we consider the case where in¬ 
comparable information sets may occur at some histories along a play, but it is 
guaranteed that a total order will be re-established in a finite number of rounds. 

Definition 13 A play yields recurring hierarchical information if it has infinitely 
many prefix histories that yield hierarchical information. A game yields recurring 
hierarchical information if all its plays do so. 

Since the set of histories that yield hierarchical information is regular in any 
finite game, according to Lemma [SJ it follows that the set of plays that yield 
recurring hierarchical information is ca-regular as well. 

Lemma 14 For every finite game, we can construct a deterministic Biichi automaton 
that recognises the set of plays that yield recurring hierarchical information. If G has n 
players and \V\ positions, the number of automaton states is bounded by 2°( n ^ K 

Proof Given a game graph G, we can construct a nondeterministic finite automa¬ 
ton A that recognises the set of histories in G that do not yield hierarchical in¬ 
formation, as in Lemma [8] By determinising the automaton A via the standard 
powerset construction, and then complementing the set of accepting states, we 
obtain a deterministic automaton A ^ with at most 2 2 " ^ states that accepts 
a word 7r £ V* if either 7r 0 Hist(G), or 7r represents a history in G that yields 

p 

hierarchical information. Finally, we take the synchronised product B of A with 
the graph G. If we now view the resulting automaton as a Biichi automaton, which 
accepts an infinite words if infinitely many prefixes are accepted by B , we obtain a 
deterministic automaton that recognises the set of plays in G that yield hierarchi¬ 
cal information. The number of states in B is at most \V\ 2 2n v I , hence bounded 

by2°(" 2 l y l 2 ). □ 
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The automaton construction provides an important insight about the number 
of consecutive rounds in which players may have incomparable information. Given 
a play n on a game graph G, we call a gap any interval [t, t + l] of rounds such that 
the histories of n in in any round of [t, t + (] do not yield hierarchical information; 
the length of the gap is l + 1. The game graph has gap size k if the length of all 
gaps in its plays is uniformly bounded by k. 

Clearly, every game graph with finite gap size yields recurring hierarchical 
information. Conversely, the automaton construction of Lemma 1141 implies, via a 
standard pumping argument, that the gap size of any game graph with recurring 
hierarchical information is at most the number of states in the constructed Biichi 
automaton. In conclusion, a game G yields recurring hierarchical information if 
and only if, the size of a gap in any play of G is bounded by 2°^ n ^ \ 


Corollary 15 If a game yields recurring hierarchical information, then its gap size 
is bounded by 2°^ n ^ \ where n is the number of players and |Vj is the number of 
positions. 

A family of game graphs where the gap size grows exponentially with the num¬ 
ber of positions is illustrated in Figure[4] The example is adapted from lBerwanger and Dovenl 
(2008!): There are two players with no relevant action choices and they can observe 
one bit, or a special symbol that identifies a unique sink position v .. The family 
is formed of graphs (G m ) m > i, each constructed of m disjoint cycles (C r )i< r < m 
of lengths pi,P 2 , ■ ■ ■ ,Pm corresponding to the first m prime numbers, respectively. 

We number the positions on the cycle corresponding to the r-th prime number as 
C r : = {cq,..., c! pr _ 1 }. On each cycle, both players receive the same observation 0. 
Additionally, there are two special positions «oi and uio, that yield different obser¬ 
vations to the players: /3 1 (uoi) = p(vio) = 0 and /3 1 (uoi) = p{v 10 ) = 1- From the 
initial position vo of a game graph G m , Nature can choose a cycle C r with r < m. 

From each position c\ € C r , except for the last one with t = p r — 1, there are is 
a moves to the subsequent cycle position c^ +1 and, additionally, to voi and vio¬ 
la contrast, the last cycle position c r Pr _i has only the first position cq of the same 
cycle as a successor. From the off-cycle positions uoi and uio the play proceeds to 
the unique sink state v» that emits the special observation • to both players. 

Now, we can verify, for each game graph G m , that in every play 7r that proceeds 
only through cycle positions, the information sets of the two players are comparable 
at a prefix history of length t > 2 in n, if and only if, all the first m primes divide 
t — 2; any play that leaves a cycle reaches the sink v», where the information sets of 
both players coincide. Accordingly, G m yields recurring hierarchical information. 

On the other hand, since the product o f the first m primes is expon ential in 
their sum, (for a more precise analysis, see IBerwanger and Dovenl ( 2008T) b we can 
conclude that the gap size of the game graphs G m grows exponentially with the 
number of positions. 

As a further consequence of the automaton construction in Lemma [Til it fol¬ 
lows that we can decide whether a game graph G yields recurring hierarchical 
information, by constructing the corresponding Brichi automaton and checking 
whether it accepts all plays in G. However, due to the exponential blow-up in the 
determinisation of the automaton, this straightforward approach would require 
exponential time (and space) in the size of the game graph and the number of 
(pairs of) players. Here, we describe an on-the fly procedure that yields better 
complexity bounds. 
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Fig. 4 Game with an exponential gap of non-hierarchical information (for better readability, 
the positions uoi, v 10 ■ and v, are multiply represented) 


Theorem 16 The problem of deciding whether a game graph yields recurring hierar¬ 
chical information is PSpace -complete. 

Proof We describe a nondeterministic procedure for verifying that an input game G 
does not yield recurring hierarchical information, that is, there exists a play in G 
such that from some round t onwards, no prefix of length £>t yields hierarchical 
information. By looking at the deterministic Biichi automaton B constructed in 
Lemma m we can tell that this is the case if, and only if, there exists a finally 
periodic play n = rp w e V u such that the run of B on n visits only non-accepting 
states after reading the prefix r and, moreover, it returns to a previously visited 
state when, but not earlier than, reaching the prefix rp. In other words, the run 
on rp induces a lasso in B, and hence, the length of rp is bounded by the number 
of states |V| 2 2 ” ^ in the automaton. 

The idea of the procedure, pictured in Algorithm!]] is to guess such a history rp, 
and to keep track of the states visited in the corresponding run of the automaton B, 
or more precisely, in the powerset construction of the automaton A from Lemma[8l 

Let us first fix a pair of players i. j. Then, every state reachable by A upon 
reading the prefix it of an input word from V u is described by a tuple («, c, w, d) 
consisting of a pair of game positions u, w and two binary flags c, d such that there 
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exist histories n', n" in G that end at u and w, satisfying n 7 /, n ^r", and the 
flags c or d are set if, and only if, n" rfj 1 7 r or 77 ^ 71 -', respectively. Essentially, this 
means that -n, tt" are candidates for witnessing that players i,j have incomparable 
information at some continuation of 7r. Every state in the automaton B on the 
powerset of A records a set Z,; ; - e(Vx {0, l}) 2 of such tuples, each collecting the 
terminal positions of all witness candidates for i, j flagged correspondingly - we call 
such a set a cell. At the beginning, the procedure guesses one cell Z tJ for every pair 
of players i,j (Line[T|); we call such a collection of cells a configuration. The guessed 
configuration stands for a state of B that shall be reached after reading r and to 
which the run returns at rp. Then, starting from the initial configuration, where all 
players just see the initial position vo (Line[3]), the procedure generates successively 
game positions of r while updating the current configuration to simulate the run 
of B on t. The current configuration Z summarises the possible runs of A on 
the prefix of r generated so far, and it is updated for every new game position v 
according to Procedure |Update| Once the configuration Z = Z is reached, the 
procedure enters a new loop. Here, it verifies in every iteration that the current 
configuration indeed contains a witness of incomparability for the input history, 
that is, there exists, in the cell of some pair of players, a state of A in which both 
distinguishability flags are set (Line [8]). Provided the test succeeds, the procedure 
successively guesses the positions of p while updating the current configuration, 
until Z is reached again, in which case the procedure accepts. Apart of the case 
when the test in Line [H] fails, the procedure also rejects by looping. 

Correctness and soundness The procedure mainly requires space to store the con¬ 
figurations Z , Z that collect a set of tuples from (V x (0, l}) 2 for each pair of 
players, hence it runs in polynomial space. To show correctness, we consider the 
sequence 7r of positions v generated in Lines [5] and [9l and argue that it forms a 
history in G and that, at every iteration of the loops in Lines l4l and 1101 the con¬ 
figuration Z contains, in each cell Z^j , precisely the set of pairs u, w of terminal 
positions reachable by candidate witnesses 7 / 7r such that n" 7r, with as¬ 
sociated flags c, d indicating correctly whether n 1 tv" and n" 7r, due to the 
way they are maintained in Lines 0] and [5] of Procedure |Update| Accordingly, if 
the procedure accepts, then there exists a finally periodic play n := rp u in G such 
that the information sets of at least two players are incomparable, in every round 
from t onwards. Soundness follows from the construction of the Buchi automaton 
in Lemma M and the observation that every run of the procdedure corresponds 
to a run of the automaton with the same acceptance status. 

Hardness Finally, we argue that the problem of deciding whether a game graph 
yields recurring hierarchical information is PSPACE-hard by redu ction from the u ni¬ 
versa lity problem for nondeterministic finite automata, shown bv lMever and Stockmeverl 
(1972) to be PSPACE-hard. Given a nondeterministic automaton A = ( Q , £,A,q 0 ,F) 
over an alphabet E, we construct a game graph G for two players with no action 
choices and with observations in B 1 = B 2 = E x {0,1} corresponding to input 
letters for A tagged with one bit. The set of positions in G contains qo and all 
letter-state pairs ( a,q ) £ E x Q. From every state (a, q) and from q = qo, there 
is a move to position ( a',q r ) if ( q,a',q ') £ A. Each position (a, q) yields the same 
observation (a, 0) to both players (the observation at qo is irrelevant). Thus, every 
run of A on a word a £ E* corresponds to a unique history in G where both 
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Algorithm 1: Deciding recurring hierarchical information 


Data: game graph G = (V, E , (3) for n players 

Result: accept if G does not yield recurring hierarchical information 

type cell : subset of (V x {0, l}) 2 

type configuration : matrix of cells (Zij)ia < jc n 

var Z, Z : configurations 

var v, v : positions in V 

var i,j : players in {1,..., n} 


1 guess ( v,Z) // fix target on cycle 

2 v 4— VQ 

3 foreach i,j with i <j do Zij <— {(uo, 0, vo, 0)}; // initial configuration 

4 while (v, Z) ^ ( v , Z) do 

5 guess v G vEA // guess next history state 

6 Z <— Update(w, Z) // follow powerset construction 


7 

8 

9 

10 

11 


repeat 

if f \ Zij D (V X {l}) 2 = 0 then reject; // hierarchical information 

i,j 

guess v G vEA 
Z <— Update(u, Z) 

until (v, Z) = (v,Z) II cycle found 

accept 


players observe the letters of a tagged with 0. In addition, G has two fresh posi- 
tions v a and v' a , for every letter a e A, with observations /3 1 (v a ) = /3 2 (v' a ) = (a, 1) 
whereas /3 1 (t>a) = P 2 (va) = (a,0). Whenever there is a move in G from a position 
v to some position (a, q) with q € F, we also allow a move from v to v a - These 
fresh positions have one common successor, identified by a distinct observations 
to both players (essentially, indicating that the game is over). Clearly, G can be 
constructed from A in polynomial time. 

Now, consider a nontrivial word a € S* and suppose that it admits an accepting 
run in A. At the corresponding history n in G, which yields the letters of a tagged 
with 1 as an observation to both players, the information sets are incomparable, 
because each player considers it possible that the other recieved the last letter 
with a 0-tag. In contrast, if a is rejected, the information sets at the corresponding 
history in G coincide, hence we have hierarchical information. In conclusion, the 
language of the automaton A is universal if, and only if, the constructed game 
graph yields herarchical information. □ 

We can show that the synthesis problem for the class of games with recur¬ 
ring hierarchical information is finite state-solvable, at least in the case when the 
winning conditions are observable. We conjecture that the result extends to the 
general case. 

Theorem 17 For games with recurring hierarchical information and observable tv- 
regular winning conditions, the synthesis problem is finite-state solvable. 

The argument relies on the tracking construction decribed in lBerwanger et aD 
, which reduces the problem of solving distributed games with imperfect in¬ 
formation for n players against Nature to that of solving a zero-sum game for 
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Procedure Update(u, Z) 

Data: new position v, current configuration Z 
Result: successor configuration after observing ft(v) 

1 foreach i, j do 

2 foreach (u,c,w,d) £ Zij do 

3 foreach u' £ uEA,w' £ wEA with and ft J (w') = /A (v) do 

4 c! <— cV ft ■’(u') 7^ ftt (w') II flag witness for j Ai 

5 d! <— d V / 3 *(?/) 7^ ft l (w') II flag witness for i A j 

0 L z; j t-Z! i .U{(u',c',»',(i')} 


return Z' 


two players with perfect information. The construction proceeds via an unravel¬ 
ling process that generates epistemic models of the player’s information along the 

rounds of a play, and thus encapsulates their uncertainty. _ 

This process described as “epistemic unfolding” in the paper ( Berwanger et all 
l201lL Section 3) is outlined as follows. An epistemic model for a game graph G with 
the usual notation, is a Kripke structure 1C = (K. (Qv) v ev> (~ l )i<i<n) over a set 
K of histories of the same length in in G, equipped with predicates Q v designating 
the histories that end in position v £ V and with the players’ indistinguishability 
relations The construction keeps track of how the knowledge of players about 
the actual history is updated during a round, by generating for each epistemic 
model 1C a set of new models, one for each assignment of an action profile a k to 
each history k £ K such that the action assigned to any player i is compatible with 
his information, i.e. for all k,k' £ K with k k' , we have a\ = a\,. The update 
of a model 1C with such an action assignment (a, k )k£K leads to a new, possibly 
disconnected epistemic model ¥f! over the universe 


I\ = {ka k w | k £ K n Q v and ( v , a k , w) £ E }, 
with predicates Q w designating the histories ka k w £ K ', and with ka k w k'a k w' 


of this updated model under the coarsening ~:= (J"_i we obtain the set of 
epistemic successor models of 1C in the unfolding. The tracking construction starts 
from the trivial model that consists only of the initial position of the game G. By 
successively applying the update, it unfolds a tree labelled with epistemic models, 
which corresponds to a game graph G' for two players with perfect information 
where the strategies of one pla yer tran sl ate into distributed strategies in G and vice 
versa. According to ( Berwanger et al. l201ll Theorem 5), for any winning condition, 
a strategy in Q' is winning if and only if the corresponding joint strategy in Q is 


The construction can be exploited algorithmically if the perfect-information 
tracking of a game can be folded back into a finite game. A homomorphism from 
an epistemic model 1C to YC' is a function / : K —> K' that preserves the state 
predicates and the indistinguishability relat ions, t h at is, Q v ( k ) => Q v (_f(k)) and 
k k' =>- f(k) f(k'). The main result of iBerwaneer et ahl (1201if) shows that, 
whenever two nodes of the unfolded tree carry homomorphically equivalent labels, 
they can be identified without changing the (winning or losing) status of the 
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game ( Berwanger et al • 2onl Theorem 9). This holds for all imperfect-information 
games with w-regular winning conditions that are observable. Consequently, the 
strategy synthesis problem is decidable for a class of such games, whenever the 
unravelling process of any game in the class is guaranteed to generate only finitely 
many epistemic models, up to homomorphic equivalence. 

Game graphs with recurring hierarchical information satisfy this condition. 
Firstly, for a fixed game, there exist only finitely many epistemic models, up 
to homo morp h ic equivalence, where the ^'-relations are totally ordered by in¬ 
clusion ( Berwanger et all 120 111 Section 5). In other words, epistemic models of 
bounded size are sufficient to describe all histories with hierarchical information. 
Secondly, by Corollary [15] from any history with hierarchical information, the 
(finitely branching) tree of continuation histories with incomparable information 
is of bounded depth, hence only finitely many epistemic models can occur in the 
unravelling. Overall, this implies that every game with recurring hierarchical infor¬ 
mation and observable winnin g condition has a fin ite quotient under homomorphic 
equivalence. According to ( Berwanger et al.i201ll Theorems 9 and 11), we can con¬ 
clude that the distributed strategy problem for the class is finite-state solvable. 

□ 


We point out that the number of hierarchic epistemic models in games, and 
thus the complexity of our synthesis procedure, grows nonelementarily with the 
number of players. This should not come as a surprise, as the solution applies 
in particular to games with hierarchical observation under safety or reachability 
conditions (here, the distinction between observable and non-observable conditions 
is insubst antial), and it is known th a t already in this ca se no elementary solution 
exists (see IPeterson and Reid ( 197^1 ; I Azhar et al.l ( 200ll) ). 


6 Games and Architectures 

The game perspective focuses on the interaction between players letting their 
individual entity slide into the background. For instance, two players may interact 
locally without observing actions of a third, more remote player. Or, there may be 
independent teams specialised in solving particular tasks independently in response 
to inputs received from a central authority. Such organisational aspects can be 
crucial for coordinating distributed processes, yet they are hardly apparent from 
the representation of the system as a game graph. 


6.1 Monitored architectures 


To interpret our results with regard to processes and the infrastructure in which 
they interact, we wish to illustrate how the game model relates to the stan- 
dard framewor k of distr ibuted architectures of reactive systems introduced by 
IPnueli and Rosneil ( 199nh . Here, we formulate the setting in more general terms 
to allow for a meaningful translation of games into architectures, and for the pre¬ 
sentation of new decidable architectures. In our framework, processes are equipped 
with a local transition structure according to which actions are enabled or disabled 
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depending on previous actions and observations. Most importantly, the commu¬ 
nication structure is not hard-wired as in the classical setting, but instead imple¬ 
mented via a finite-state device that can dispatch information in different ways 
depending on the previous run. 

A process is represented by an automaton V := ( Q,A,B,qo,5 ) on a set Q of 
states, with output alphabet A, input alphabet B , initial state qo € Q, and a partial 
transition function 8 \ QxAxB^Q. The symbols of the output alphabet A are 
called actions and those of the input alphabet B observations. We say that an 
action a £ A is enabled at a state q £ Q, if ( q,a,b ) £ dom(<5) for some b £ B. We 
assume that for every state q £ Q the set act (q) of enabled actions is nonempty, and 
that for each enabled action a £ act(g) and each observation b £ B, the transition 
8 (q, a, b) is defined. 

The automaton describes the possible behaviour of the process as follows: Every 
run starts in the initial state qo- In any state q, the process chooses one of the 
available actions a £ act(g), then it receives an observation b £ B and switches 
into state 5(q,a,b ) to proceed. From a local perspective, the automaton expresses 
which actions of the process are enabled or disabled, depending on the sequence 
of previous actions and observations. From an external perspective, V defines a 
(local) behaviour relation R-p C (A x B )“ consisting of the sequences of action- 
observation symbols in possible runs, which represents how the process may act in 
response to the observation sequence received as input@ In the following, we will 
not distinguish between the automaton V and the defined behaviour relation Tip. 

A black-box process is one with a single state, i.e., the set of enabled actions 
is independent of previous actions or observations. A white-box process is one 
where every state has precisely one enabled action - hence, a Moore machine, or 
a finite-state strategy in the game setting. A program, or strategy for process V is 
a white-box S with behaviour SCP. 

A monitored architecture is represented by two elements: a collection of pro¬ 
cesses V 1 ,... ,V n plus a distinguished black-box process V° called Environment 
on the one hand, and a specification of the communication infrastructure, called 
view monitor, on the other hand. When we refer to process i, we identify all as¬ 
sociated elements with a superscript and write V 1 = (Q l , A 1 , B l ,q l 0 ,5 l ). The set 
of global actions is the product T := A 0 x A 1 x • • • x A n of the action sets of all 
processes, including the Environment. Then, a view monitor is a Mealy machine 
M = {M,r,B,mo,p,u) with input alphabet E and an output alphabet consisting 
of the product B := B 1 x ■ ■ ■ x B n of the observation alphabets of all processes. 

Hence, the monitor M transforms sequences of global actions into a tuple of 
observations, one for each process i £ { 1 ,... n}. The observations of the Environ¬ 
ment are irrelevant, we always assume that B° = {0}. Since the Environement is a 
black box, it is completely specified by its set of actions, which already appears in 
the description of the view monitor. Therefore, (P 1 ,... ,V n ,M) yields a complete 
description of a monitored architecture. 

A global behaviour in a monitored architecture is an infinite sequence of global 
actions and observations n := (ao, bo)(ai,b±), ■ ■ ■ £ (E x such that the ob¬ 
servations corresponds to the output of the view monitor, bt = fi(ao,...at) for 
all t > 0 and, for every process i, the corresponding action-observation sequence 

3 Such automata are called synchronous sequential transducers or nondeterministic gener¬ 
alised sequential machines in the literature. 
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(a,Q, bo)(a[, b\)... represents a behaviour in V 1 . We refer to the sequence of aoa± ... 
of global actions in a global behaviour as a (global) run. 

A distributed program is a collection S = (S 1 , ..., S n ) of programs, one for every 
process. A global behaviour n is generated by a distributed program S , if for every 
process i, the action-observation sequence (ag, bg)(a\, b\)... represents a behaviour 
in S i C V\ 

The run tree of a distributed program S is the set T$ C r* of prefixes of global 
runs generated by S. Properties of runs are described by a linear-time or branching¬ 
time specification, given by an ca-word automaton or an w-tree automaton over F, 
respectively. A run tree T satisfies a linear-time specification if all branches are 
accepted, and it satisfies a branching-time specification, if the tree T is accepted by 
the specification automaton. We say that a distributed program S is correct with 
respect to a specification if the generated run tree T$ satisfies the specification. 
Given an architecture together with a specification d>, the distributed synthesis 
problem asks whether there exists a distributed program S that is correct with 
respect to the specification $. 


6.2 From architectures to games and back 


A monitored architecture ("P 1 ,... V n , Ai), can be translated into a distributed 
game G = ( V , E, 0) for n players as follows. The set V of positions is the prod¬ 
uct B x M x Q 1 x x Q n of the global observation space with the state sets 
of the view monitor and of all processes (excluding the Environment), with ini¬ 
tial position (&o, mo, qg,... ,qg) f° r some (irrelevant) initial observation. There is a 
move ((&, m, q),a, (b 1 , m!, q')) £ E, whenever the components are updated correctly, 
that is, the global observation b’ = u(m,a), the memory state of the view monitor 
m! = p(m, a), and the local control state q’ 1 := 6(q l , a 1 , b' 1 ) for every process i. (No¬ 
tice that the observation at the source does not matter). Finally, the observation 
function for each player i assigns (3 l (b,m,q) := b l . 

Every prefix of a global run in the architecture induces a unique run prefix 
in the view monitor M. and thus a history in the associated game, such that any 
program s z for a process i corresponds to a strategy for player i and every speci¬ 
fication induces a winning condition expressible by an automaton over the states 
of M. Further, the outcome of a distributed finite-state strategy induces the run 
tree generated by the corresponding distributed program, hence every distributed 
programs that satisfies the specification correspond to winning strategy, and vice 
versa. 


In this paper, we only considered ga mes wit h li near-t i me w inning conditions. 
Nevertheless, the automata technique of lKunferman and Vardil ( 20011) for solving 


games with hierarchical observation (Theorem [2]) also applies to the branching¬ 
time setting. Thus, our decidability results for static and dynamic hierarchical 
information ('TheoremlSland flOl) generalise immediately to branching-time winning 
conditions. 

In summary, we can regard system architectures as a syntactic variant of games, 
modulo the transformation described above. 


Proposition 18 Every instance of the distributed synthesis problem over architec¬ 
tures can be reduced to an instance over games such that the finite-state solutions are 
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preserved. For an architecture ("P 1 , ■ ■ ■, V n , M) and a specification given by a parity 
automaton A$, the reduction runs in time 0( \M\ (|A^| + | V 1 x • • • x V n \)). 


For the converse, to translate a given game graph G = (V, E, (5 ) for n players 
into an architecture, we proceed as follows. For every player i, we first consider 
the localised variant G l = (V 1 , E l , fF) of the game graph, obtained by replacing 
every action label a in a move ( u,a,v ) € E with the component a\ and retaining 
only the observation function ft 1 . Then, we view the resulting one-player game 
graph as a nondeterministic finite-state automaton over the alphabet A 1 x B l , by 
placing the observation symbol b l = fF{v) from the target v of any move ( u,a‘,v ) 
on the incoming edge to yield the transition (u, (a l ,b’),v). Finally we determinise 
and minimise this automaton. The resulting automaton ( Q,A l x B l ,vo,5) may 
not be complete in the second input component, as it is required for a process. 
Therefore, we add a fresh sink z with incoming transitions ( q , ( a l ,b l ),z ) from any 
state q where action a 1 is enabled, but S(q, ( a 1 ,b *)) is undefined. The automaton V 1 
obtained in this way corresponds to a process. 

To construct the associated view monitor, we expand the alphabet A of joint 
actions by adding a set A 0 of Environment actions, which we call directions. Intu¬ 
itively, directions correspond to choices of Nature. Accordingly, we choose A 0 to 
be of size outdegree of G and expand the action profiles in the moves of E with 
directions, such that at any position, if there are two different outgoing moves, 
the direction in their action labels differs. Thus, we obtain a representation of G 
as a deterministic automaton with transition relation p : V x F — > V over the 
expanded alphabet F := A 0 x A 1 x ■ ■ ■ x A 71 . Additionally, we define an output 
function v : V X F —> B that assigns to every position u and every direction-action 
profile a£f, the profile of observations for v = p{u,a). Finally, we consider 

the Mealy automaton M = (V,F,B,vq,p,v) as a view monitor for the collection 
of processes V 1 . 


By way of this translation, the histories in G correspond to prefixes of runs in 
the architecture (P 1 ,..., V n , M ), such that any finite-state strategy for a player i 
corresponds to a program for process i, and the outcome of any finite-state strat¬ 
egy profile s corresponds to the run tree generated by the distributed program 
corresponding to s. Since the game graph is deterministic in the action alpha¬ 
bet expanded with directions, every winning condition induces a specification, 
such that the solutions to the distributed synthesis problem are preserved. The 
specification is obtained as a product of the winning-condition automaton syn¬ 
chronised with the game graph (in the determinised variant with direction labels); 
in addition, any run that reaches a sink is deemed correct. Accordingly, the speci¬ 
fication automaton is polynomial in the size of the game. However, as the process 
automata are constructed by determinisation and minimisation of the localised 
game graph, the overall tr a nslati on involves a necessary exponential blowup (see 
Ivan Glabbeek and Ploeg~erl ( 2008lV ). 


Proposition 19 Every instance of the distributed synthesis problem over games can be 
reduced, in exponential time, to an instance over architectures such that the finite-state 
solutions are preserved. 
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6.3 Pipelines, forks, and hierarchical observation 


The classical setting of lPnueli and Rosned ( 1990h corresponds to the special case 
of architectures where all processes V 1 ,... ,V n are black boxes, and the action 
alphabet A 1 of each process i consists of a product X 1 x Y l of sets of control and 
communication signals. The input alphabets and the communication infrastructure 
are described by a directed graph (({1,..., n}, ->) with arcs i—¥ j expressing that, 
in every round, the communication signal y l emitted with the action a 1 = (x l , y l ) of 
process i is received by process j. Thus, each player i has an observation alphabet 
B l formed by the product of the signal sets Y° of all its predecessors j —y i in 
the graph. In our framework, the communication graph corresponds to a view 
monitor A4 1 with only one state, which works as follows: upon input of a global 
action ((x°, y°), (x 1 , y 1 )..., ( x n , y n )), output the global observation b composed of 
the collection of signals b l := (y J \ j —» j) for each player i £ { 1,... ,n}. 

A pipeline architecture in the Pnueli-Rosner setting is one where t he communi¬ 
cation graph is a directed path 0 —> 1 —> 2 • ■ • —> n . The main result of lPnueli and Rosncrl 
(1990), later extended bv lKupferman and Vardil ( 200l|l . shows that the distributed 
synthesis problem is solvable for pipelines architectures. We prove that games 
with hierarchical observation and regular winning conditions can be translated 
into pipeline architectures with regular specifications such that the solutions to 
the distributed synthesis problem are preserved, thus justifying our statement in 
Theorem [2] as a corollary of the cited results. 


Theorem 20 The distributed synthesis problem for games with hierarchical observa¬ 
tions reduces to the corresponding problem for pipeline architectures. 

Proof Consider a game graph G = (V, E, (3) that yields hierarchical observation 
with a winning condition W C V u given by an automaton. For simplicity, let us 
assume that the information hierarchy among the n players follows the nominal 
order 1 < • • • < n. Accordingly, the observation of each player i > 1 is determined 
by the observation of player i — 1, in the sense that there exists a function f l : 
R* _1 —>- B 1 such that /3 l (v) = / l (/T -1 (r;)) for every position v € V. By defining 
f 1 : R 1 — y B 1 to be the identity, we can hence write /3 l (v) = f l of l ~ 1 o■ ■ •o/ 1 (/ 3 1 (v)). 

First, we transform the given game instance into a monitored architecture 
(fP 1 ,..., V n , M) and a specification <P constructed as in Proposition [T9] As the 
game yields hierarchical information, the output function of the view monitor M 
satisfies i/(m, a) = f z of l ~ 1 o- ■ •o/ 1 (i/ 1 (m, a)), for each player i, every memory state 
m, and global action a. Recall that M is deterministic in the first component of 
its input alphabet, which corresponds to Environment actions (directions). Hence, 
we can write ^( 171 , 0 ) as ^(rcpa 0 ). 

Next, we sequentialise the obtained architecture: For each process i < n, we 
expand the action alphabet A 1 with the observation alphabet B l+1 of the next 
player. Thus, we set A 1 := A 1 x B l+1 and consider the actions in A 1 as internal 
signals and the observations B l+1 as communication signals. The actions of the 
last process carry no relevant communication symbols, we set A n = A n x {0}. Now, 
we modify each process i < n to output in addition to his action a 1 (now an internal 
signal), the observation value f l+1 (6 l ) as a communication symbol - where b l refers 
to the observation received by process i in the previous period (or the default 
initial observation); to implement this, the value f l+1 (b l ) is stored in the state of 
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the process when receiving b* and then added to all outgoing transitions. Let us 
call the modified processes V 1 '. Again, the last process remains unchanged. 

Notice that process V 1 relies on observations ^ 1 (m, a 0 ) rather than just en¬ 
vironment action a 0 . To fix this, we take its synchronised product with At. The 
resulting process MxV 1 inputs only environment actions and updates the monitor 
state internally, to yield the same output ^(m, a 0 ) as V 1 . 

As a view monitor At, we take the st andard monitor for the co mmunication 
graph 0 —> 1 — >2—>■■■—>n, in the sense of iPnueli and Rosnerl ( 1990h . as described 
in the beginning of the subsection. 

To account for the latency introduced by sequentialisation, we adjust the spec¬ 
ification 4? over the original set of global actions F = A 0 x A 1 x ... A n . For every 


infinite word a = aoai • • • G F u formed of global actions at = (cq < 


}) for 


all t > 0, we denote by pipe(a) := a' 0 , a' 2 , • • • G F u the infinite sequence of global 
actions where a[ = (a°, £q +1 ,..., a" +n ), for all t > 0. We can verify that for ev¬ 
ery global run a in the sequence pipe(a) represents a run in the pipeline 

(V, At). Finally, we define the specification := (pipe(a) G r u \ a G W}. 

For the monitored architecture (A'lxP 1 , V 2 , ..., F n ,M) and the specification S, 
every correct distributed program corresponds to a finite-state distributed winning 
strategy in the original game, and vice versa. 

To conclude the reduction, we restrict the specification 4> to the set of global 
runs generated by the processes of this architecture, and replace the processes with 
bl ack boxes, thus obtaining a hard-wired pipeline architecture in the framework 
of iPnueli and Rosnerl ( 199(1) . which preserves the solutions of the distributed syn¬ 
thesis problem for the game G with hierarchical observation from the outset. □ 


Interestingly, the direct translation of pipeline architectures into games does 
not necessarily result in games with hierarchical observation. Consider, for instance 
a pipeline with three processes 1 < 2 < 3, each with a one-bit communication al¬ 
phabet. In the corresponding game, while player receives an input bit from the 
environment, player 2 can play an action that reveals a bit to process 3 which is 
not revealed to player 1, thus the information sets of player 1 and player 3 become 
incomparable. Indeed, the architecture model does not a priori prevent processes 
to emit signals that are independent of the received input. The condition is en¬ 
sured only in the implemented system, where every process follows its prescribed 
program. For arbitrary architectures (already in the Pnueli-Rosner framework), it 
seems hard to incorporate this condition into the process of designing a correct 
distributed program. 

For the case of pipelines, it is easy to work around this circularity. The idea 
is to send the input of every player to all the previous players, as illustrated in 


Figure 5(a) Formally, this amounts to transforming a given architecture A on 
the communication graph ({1,..., n}, —>) by adding feedback links i j from any 
process i to all processes j < i. Clearly, the resulting architecture A! corresponds 
to a game with hierarchical observation. We argue that the distributed synthesis 
problem is invariant under this transformation 


Lemma 21 Every pipeline can be reduced, by adding feedback links, to an architecture 
that corresponds to a game with hierarchical observation and admits the same solutions 
to the distributed synthesis problem. 
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^ 0 ^( 0 ^ 0 ^ 0 ^ -^ 0 — 0 — 0^03 

(a) pipeline with added feedback links (b) two-way chain 


Fig. 5 Pipelines and chains (original links solid, added feedback links dotted) 


Proof Any distributed program for a pipeline architecture A generates the same 
run in the architecture A' with added feedback links (which are ignored), hence 
every solution for A is also a correct distributed program for A! . Conversely, given 
a distributed program S' for an architecture A' with feedback links added to 
a pipeline architecture A as above, we can construct a distributed program S 
for A by considering, for every process i < n, the synchronised product of S" with 
S' l+ 1 x • ■ • XiS /ri ; hence, the current observation of each player j > i is maintained in 
the control state. The program 5* is built from the product automaton, by using 
the observation data from the state rather than the signals from the incoming 
feedback links. Accordingly, S is a distributed program for the pipeline A that 
generates the same runs as S', hence the transformation preserves correctness 
under any specification. □ 


In terms of games, the argument of Lemma [2T] can be rephrased as follows: 
in the game corresponding to a pipeline architecture, the game graph induced by 
any finite-state strategy profile yields hierarchical information. Then, we view the 
output of a hypothetical finite-state strategy (program) of every player i as a finite- 
state signal. Due to the pipeline structure, this signal is information-consistent for 
each receiving player j < i, hence it can be made observable (across the feedback 
links). Thus, we obtain a game with hierarchical observation that is equivalent to 
the one that corresponds to the pipeline at the outset. 

Using this idea, we can recover furt her r esults on decidable arch i tectur es pre¬ 
sented bv lKupferman and Vardil ( 200lh and lFinkbeiner and Schewd ( 20051) . Two- 
way chains, for instance, that is, pipelines where every process i > 1 has an ad¬ 
ditional link to process i — 1 as pictured in Figure 5(b) lead to the same game 
as the underlying pipeline under the transformation of Lemma [2T] The case of 
rings with up to four processes is similar: a ring is a two-way pipeline with an 
additional two-way link between the first and the last process. In a four-process 
ring as pictured in Figure 6(a)[ for any distributed program, process 1 can infer 
the signals emitted by 3 (for 2 and 4), and process 2 the signals emitted by 4. 
Hence, we can add feedback links from 3 to 1 and from 4 to 2 without changing 
the set of runs generated by distributed programs. By doing so, we obtain a game 
with hierarchical observation in the order 1 < 2 « 4 A 3. 

However, two-way rings with five or more black-box processes can in general 
not be transformed into games with hierarchical observation by adding feedback 
links. For any linear ordering < of the processes, the addition of feedback links 
from each process i to all j < i leads to an architecture that allows spurious 
runs, which cannot be generated by distributed programs in the original archi¬ 
tecture. Figure 6(a) illustrates that hierarchical observation cannot be attained 
with the ordering 1<2<3<4<5, for instance, as the feedback link from 5 
to 3 (dashed in the picture) represents a sig nal that is not observatio n -cons istent 
for process 3. Indeed, as pointed out by Mohalik and Walukiewic3 ( 20031) and 
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(a) four processes attain hierarchical 
observation 



not observation-consistent 


Fig. 6 Two-way rings with information-consistent feedback links (dotted) 


iFinkbeiner and Schewel ( 20051) . the synthesis problem for two-way rings with at 
least five black-box processes is undecidable. 

In general, achitectures in the classical framework of Pnueli and Rosner, where 
the communication links are hard-wired, present the following dichotomy: either 
there exists a total ordering A among processes, such that the addition of feedback 
links {i —)• j | j < i} leaves the set of runs generated by any distributed program 
unc hanged, or the architec t ure co ntains an information fork, in the sense defined 
by iFinkbeiner and Schewei ( 2005i) - in the latter case, the authors of the cited 
paper show that there always exists a specification under which the distributed 
synthesis problem is undecidable. 


7 Effective synthesis for monitored architectures 


By relaxing the condition of hierarchical information to include changing or in¬ 
termittent hierarchies, we obtained more general classes of games on which the 
distributed synthesis problem is effectively solvable. How does this generalisation 
carry over to distributed architectures? In the standard framework of architec¬ 
tures with hard-wired communication links, there is little hope for redrawing the 
decidability frontier for distributed synthesis by exploiting patterns of dynamic 
or recurring hierarchical informa tion: According to the information-fork criterion 
of lFinkbeiner and Schewei ( 20051) . any solvability condition on architectures must 
either restrict the specifications, or request that the communication graph is es¬ 
sentially a pipeline. Since hierarchical information —in all considered variants — 
is a property of the game graph, independent of the winning condition, it would 
be unnatural to cast it as a restriction on the system specification. On the other 
hand, solvability of pipeline architectures is already covered by the basic condition 
of hierarchical observation. 

Nevertheless, we encounter situations in practice where the components of a 
distributed system coordinate successfully without being restricted to communi¬ 
cate along the links of a fixed pipeline, or even a fixed communication graph. 
Consider, for instance, a system that operates in time phases, each with a specific 
workflow among component processes that are organised in a pipeline, but just for 
the duration of one phase; in the next phase, the workflow may change and follow 
another pipeline. If, at the end of each phase, all players are updated with the 
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information received in the global system, we obtain a situation corresponding to 
a game with dynamic hierarchical information. Even if the architecture allows any 
two processes to communicate in any direction, the described phase design ensures 
that dynamic hierarchical information is maintained, hence the synthesis problem 
is solvable. Our shadow-player construction can be understood as an instance of 
this idea. Beyond maintaining reconfigurable pipelines, one may furthermore allow 
workflows that propagate incomparable information, as long as this occurs for a 
bounded number of rounds. 


7.1 Hierarchical architectures 

One way to ensure that the synthesis problem is solvable on a class of instances 
is by restricting the architecture so that the corresponding game graph yields 
(static, dynamic, or recurring) hierarchical information. To verify whether a moni¬ 
tored architecture satisfies the condition, we can use the procedures from Lemma[9] 
and Theorem [16] directly, without constructing the corresponding game. Given an 
architecture (P 1 ,..., V n , M) the procedure for dynamic (or static) hierarchical in¬ 
formation runs in space 0(X^=i l°s(l - ^ >1 |) + log(|^V11)), and for the case of recurring 
hierarchical information, it runs in space 0((n| X” =1 V 1 x At|) 2 ). 

In the fixed-architecture framework, solvable classes are characterised by a ba¬ 
sic pattern of the communication graph: a pipeline where no process i is allowed 
to receive signals from any process j < i — 1. This ensures that every run maintains 
hierarchical observation, hence the game graph corresponding to the architecture 
yields hierarchical observation, which implies that the synthesis problem is solv¬ 
able. We can identify similar solvability patterns for monitored architectures by 
restricting to view monitors that never deliver signals which would violate the con¬ 
dition of dynamic hierarchical information. For any architecture equipped with 
such a monitor, the synthesis problem is solvable with respect to every specifi¬ 
cation. Deciding whether a given view monitor M yields dynamic or recurring 
hierarchical information (in the corresponding game graph) with every matching 
collection of processes amounts to deciding whether the architecture formed of 
black-box processes and the view monitor M yields dynamic or recurring hier¬ 
archical information, and can hence be done in logarithmic or polynomial space, 
respectively. 

Thus, we obtain classes of monitored architectures on which the distributed 
synthesis problem is solvable as a direct application of our game-theoretic analysis. 


7.2 Maintaining hierarchical information through strategies 

Our second proposal for automated synthesis in the framework of monitored archi¬ 
tectures relies on enforcing hierarchical information strategically, at the program 
level, rather than restricting the architecture a-priori. In this way, the task of 
avoiding incomparable information is put in the hands of the designer of the dis¬ 
tributed system. We describe an automated method to help the system designer 
accomplish this task. 

Let us first detail our argument in terms of games. Note that, in view of recur¬ 
ring hierarchical information, it is undecidable whether, for a given game, there 
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exists a strategy that is winning and also avoids infinite gaps with incompara¬ 
ble information: this follows by adapting the standard reduction from the Halt¬ 
ing Problem to the synthesis of reachability strategies in two-player games with 
i mper fect information (see, e.g.. IPeterson and Reid ( 1979h . feerwanger and Kaiser! 
(2010)). Therefore, we restrict our attention to dynamic hierarchical information. 


Definition 22 Given a game, a strategy s maintains hierarchical information if 
every history that follows s yields hierarchical information. 

One straighforward, but important insight is that the synthesis problem re¬ 
stricted to strategies that maintain hierarchical information is effectively solvable. 


Theorem 23 For any finite game, it is decidable whether there exists a distributed 
winning strategy that maintains hierarchical information, and if so, we can synthesise 
one. 


Proof Let G be an arbitrary finite game with an w-regular winning condition. 
According to Lemma[8] the set of game histories that yield hierarchical information 
is regular. Let A be a deterministic automaton that recognises this set and consider 
its synchronised product G x A with the game graph G. From this product, we 
construct a new game graph G' by adding a sink position Q with a fresh observation 
to be received by all players, and by replacing all moves ((v,q),a, ( v,q')) in G x A 
where the automaton state q' at the target is non-accepting with ((v,q),a,Q). 
Hence, all histories in G' yield hierarchical information and every history in G 
that does not yield hierarchical information, maps to one in G' that ends at ©. 
Finally, we adjust the winning condition of Q by expanding each play with the 
corresponding run of A and by excluding all plays that reach ©. 

The game Q' constructed in this way yields dynamic hierarchical information, 
hence the synthesis problem is effectively solvable. Moreover, every distributed 
winning strategy in Q' corresponds to a distributed strategy in Q that is win¬ 
ning and maintains hierarchical information in the sense of Definition 1221 and 
conversely, every winning strategy that maintains hierarchical information in Q 
corresponds to a winning strategy in Q'. □ 

The above theorem gives raise to an effective, sound, and incomplete method 
for solving the synthesis problem for monitored architectures: Given a problem 
instance consisting of an architecture and a specification, solve the synthesis prob¬ 
lem for the corresponding game restricted to strategies that maintain hierarchical 
information, and translate any resulting finite-state winning strategy back into a 
distributed program. This approach omits solutions that involve runs which do not 
yield hierarchical information. However, if there exist correct distributed programs 
that also maintain hierarchical information, the procedure will construct one. 

Alternatively, the method based on Theorem [23] can be viewed as a complete 
procedure for solving the synthesis problem on instances with arbitrary archi¬ 
tectures, but with specifications restricted to run trees in which every run (corre¬ 
sponds to a play that) yields dynamic hierarchical information - a regular property 
according to Lemma [8] 
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7.3 Hierarchical routing 

To conclude, we present a concrete example of an architecture design that supports 
the application of our method without otherwise restricting the solution procedure. 

We set out with the observation that the condition of dynamic (or static) 
hierarchical information is a safety condition that can be supervised by the view 
monitor. Our proposal is to incorporate into view monitors the ability to send 
hierarchy-related data to the processes, which can be used by the programs to 
ensure that hierarchical information is maintained whenever possible. 

To illustrate the idea, we consider architectures of a particular format, which 
we call routed architectures. Intuitively each processes can emit signals addressed 
to any other process, and the view monitor, called router , either delivers a signal or 
denies it, according to a deterministic rule. In case of denial, the sending process 
receives a notification. The intention is to maintain hierarchical information on 
every run as far as possible. However, signals sent by the Environment cannot be 
denied, and inter-process signals may also be forced for delivery by the emitting 
process. This may lead to violations of the condition of hierarchical information, in 
which case the monitor sends a panic signal to all processes and henceforth simply 
delivers all emitted signals. 

Formally, a routed architecture for n processes and the Environment features 
action of a special format. Each action has a control and a communication compo¬ 
nent: the control component is a single symbol as in the case of static architectures; 
the communication component is a (possibly empty) list of signals, each formed of 
a body - a single symbol - and a header, which contains the identities i,j of the 
sending and the receiving process, and a priority number. In Environment actions, 
all signals carry top priority (which forces delivery). We assume that no process 
appears twice as a receiver in one action. The observations of each process are 
formed of a communication component, which consists of a list of signals, at most 
one from every other process, and a notification component, which consists of a 
panic flag and a list of delivery flags, one for every other process. 

Priority numbers are used to determine the observations delivered in response 
to a global action. Towards this, the priorities of all signals in the communica¬ 
tion component of an observation are aggregated. The aggregation function is 
monotonous with respect to the inclusion between sets of signals, and each set of 
observations has a unique element of maximum value (to break ties, we may use 
process identifiers). Since the observation space is finite, it is always possible to 
define a priority aggregation function with these properties. 

The router for a given collection of processes is a view monitor that operates 
as follows. There is one sink state called panic state: here the router reads the 
communication components of the global action and delivers to each process i the 
list of all signals addressed to i as a receiver, also setting the panic flag and deliv¬ 
ery flags for each signal emmited by process i. In any other states, including the 
initial one, the router reads the global action a and considers the set of admissible 
observations: a global observation b is admissible if (1) the communication com¬ 
ponent consists of signals emitted in the global action a and contains all signals 
with top priority, and (2) the delivery flags are set correctly, and the panic flag 
is reset, if and only if, the run prefix that would be reached by delivering the 
observation b (corresponds to a play that) yields hierarchical information. If there 
exist admissible observations that do not raise the panic flag, the monitor picks 
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the one of maximal aggregated priority, delivers it to the processes, and switches 
into a non-panic successor state. Else, if all admissible observations raise the panic 
flag, the monitor delivers all received signals and switches into the panic state. As 
the condition of hierarchical observation corresponds to a finite-state signal, the 
described operation of the router can be implemented by a finite-state monitor. 
Essentially, the states store the data needed to supervise the information hierarchy 
ordering. 

In terms of expressiveness, routed architectures lie between hard-wired and 
monitored architectures. As in hard-wired architectures, signals are routed unal¬ 
tered between processes, the difference being that contents and receiver of a signal 
are chosen by the emitting process. In contrast, the signals delivered by an arbi¬ 
trary view monitor to different processes can be highly correlated depending on 
on the global action and the state of the monitor. For routed architectures, the 
correlation is restricted to non-delivery of signals due to violating the condition of 
hierarchical information. 

Given a routed architecture and a (linear or branching-time) specification, we 
can use the incomplete synthesis method presented in Subsection [8] to synthesise a 
distributed program that maintains hierarchical information. If this succeeds, we 
obtain a correct distributed program for which no panic or non-delivery flag will 
ever be raised. 

However, assuming that the given specification is insensitive to signals that 
are not delivered, it is sufficient to synthesise a distributed strategy that avoids 
the panic signal, but allows non-delivery of signals. In game-theoretic terms, this 
corresponds to viewing signalling attempts as cheap-talk actions. The option of 
sending signals that may be denied does not enlarge the class of solvable prob¬ 
lem instances: in principle, each process i can infer from its observation sequence 
that a certain signal will not be delivered, since this can occur only when the 
receiving process is less informed than i - in other words, the non-delivery no¬ 
tification is information-consistent for process i, so it yields no new information. 
Nevertheless, in practice, the warning mechanism against driving the system into 
a non-hierarchical state offers the system designer more freedom in programming 
the processes than the setting where such attempts are forbidden. 

Finally, for problem instances that do not admit a complete hierarchical solu¬ 
tion, our model of routed architectures allows to combine strategies synthesised 
via the method from Subsection [H] applied on the prefix of the run tree on which 
hierarchical information is maintained with other strategies that are triggerred 
when the panic signal is raised. 


8 Discussion 

The task of coordinating several players with imperfect information towards attain¬ 
ing a common objective is quintessential in the design of computational systems 
with multiple interacting components. Known automated methods for accomplish¬ 
ing this task rely on a basic pattern of hierarchical information flow. Here, we 
showed that this pattern can be relaxed considerably, towards allowing the hier¬ 
archy to rearrange in the course of the interaction, or even to vanish temporarily. 

Our technical analysis is based on finite-state games with perfect recall and 
synchronous dynamics. The model has the advantage of exposing the fundamental 
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connection between knowledge and action as put forward bv lMose 1 » ■ Indeed, 
our results on solvable cases for the coordination problem in games rely on the 
key concept of identifying finite-state signals that provide sufficient knowledge for 
triggering winning strategies. 

The game model has shortcomings as well. The assumption of perfect recall 
is rather uncritical, as we finally synthesise strategies implementable by finite- 
state automata. However, the assumption of synchronous dynamics does restrict 
the scope of our results: the uncertainty about the ordering of event occurrences 
in asynchronous systems cannot be captured directly by imperfect information in 
gam es, and different methods are required t o approach the synthesis problem. (See 


eg 


.M adhus udan and Thiagaraianl t'2002'l. G astin et al.1 < 200 91. Muschol l and Walukiewic3 
( 2014h .I Insights on information and coordination in games may shed light on the 
asynchronous setting as well, however, at the current state of research the areas 
appear divided. 

Perhaps the greatest challenge is to make concrete how the game-theoretic re¬ 
sults can help in designing real-world systems. In terms of operat ional models, our 
games are close to the distributed reactive systems of lPnueli and Rosner ( 199 01. 
described by a communication architecture and a regular specification: Instances 
of the synthesis problem can be translated back and forth between the game and 
the system model. Nevertheless, it turns out that games with (static, dynamic, 
or recurring) hierarchial information, do not correspond to natural classes of dis¬ 
tributed architectures. This is because most of the interaction structure described 
by a game graph is incorporated into the specification on the side of the archi¬ 
tecture. While the game graph expresses which actions will be available or not in 
a certain state, the specification expresses which actions should occur or not in a 
run of a correct program. Since the original model of distributed reactive systems 
is too permissive in describing possible behaviours of processes, a meaningful clas¬ 
sification in terms of information flow patterns would need to refer to behaviours 
of correct programs - which we aim yet to construct. 

To overcome the cleavage between game and architecture representations, we 
introduced the model of monitored architectures where the hard-wired communica¬ 
tion graph of the Pnueli-Rosner model is replaced by a transducer that transforms 
global actions into signals sent to the processes, in a dynamic way, depending on 
the previous run. This yields a faithful representation of the information flow in the 
operational model, which allows to translate the decidability results on games with 
hierarchical information. Thus, we obtain rich classes of distributed architectures 
on which the automated synthesis problem is solvable. 

In continuation of our study, we see three directions for further research. One 
promising approach is to refine the classification of solvable classes according to the 
structure of the winning condition (or the specification) rather than considering 
only the set of possible behaviour represented by the game grap h (o r the architec- 
ture). In the classical framework of Pnueli and Rosner, Madhusudan and Thiagaraianl 
( 2001 ) considered specifications that are local , in the sense that they require each 
process to satisfy a condition expressed on the state space of its own automaton. 

The authors show that, under such specification, the class of solvable architec¬ 
tures includes clean pipelines, that is pipelines where the last process receives pri¬ 
vate signals from the environment. Technically, the games corresponding to clean 
pipelines do not yield hierarchical information, nevertheless, the synthesis problem 
can be solved by a straightforward adaptation of the automata-theoretic method 
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of IPnueli and Rosned ( 1990h : iKunferman and Vardil (2001). As another example, 
our method for synthesising distributed programs that maintain hierarchical in¬ 
formation, presented in Subsection , can be interpreted as a complete solution for 
arbitrary architectures with specifications restricted to runs that yield dynamic 
hierarchical information. 

A second concrete direction relies on assessing the condition of hierarchical 
information at runtime rather than considering the raw system model. In terms 
of games, this corresponds to asserting that the game graph induced by every 
strategy yields hierarchical information, although this may not be true for the 
original game. Such an approach would allow to capture functional dependencies 
in runs generated by distributed programs, and lead to even more liberal classes 
of solvable architectures. 

Finally, we may include the view monitor in the synthesis process. So far, we 
modeled the view monitor as a white-box process that represent the available com¬ 
munication infrastructure. However, in practical applications, the infrastructure 
does not need to be fixed, the system designers may be able to configure certain 
of its parameters. On the other hand, as we showed in Subsection 17.31 it can be 
helpful to use the abilities of the view monitor as a global observer to provide the 
processes with signals about the global run, even if they are observation-consistent 
and hence could be deduced locally by the receiver. Therefore, it will seems appro¬ 
priate to consider view monitors as grey-box processes, and synthesise white-box 
implementations using automated procedures. 

At bottomline, we are confident that the quest for effective automated synthesis 
is worth pursuing. To correct the discouraging picture drawn by the fundamental 
undecidability results, it is important to identify natural classes of models on 
which the problem is solvable. We believe that hierarchical information offers a 
convincing example in this direction. 
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